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Abstract 

We study graph partitioning problems from a min-max perspective, in which an input graph 
on n vertices should be partitioned into k parts, and the objective is to minimize the maximum 
number of edges leaving a single part. The two main versions we consider are where the k 
parts need to be of equal-size, and where they must separate a set of k given terminals. We 
consider a common generalization of these two problems, and design for it an 0{\/\ogn\ogk)- 
approximation algorithm. This improves over an 0(log^ n) approximation for the second version 
due to Svitkina and Tardos | ST04| , and roughly O(fclogn) approximation for the first version 
that follows from other previous work. We also give an improved 0(l)-approximation algorithm 
for graphs that exclude any fixed minor. 

Our algorithm uses a new procedure for solving the Small-Set Expansion problem. In this 
problem, we are given a graph G and the goal is to find a non-empty set S QV oi size IS*! < 
pn with minimum edge-expansion. We give an 0{^\ogn\og (1/p)) bicriteria approximation 
algorithm for the general case of Small-Set Expansion, and 0(1) approximation algorithm for 
graphs that exclude any fixed minor. 

1 Introduction 
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We study graph partitioning problems from a min-max perspective. Typically, graph partitioning 
problems ask for a partitioning of the vertex set of an undirected graph under some problem- 
specific constraints on the different parts, e.g., balanced partitioning or separating terminals, and 
the objective is min-sum, i.e., minimizing the total weight of the edges connecting different parts. 
In the min-max variant of these problems, the goal is different — minimize the weight of the edges 
leaving a single part, taking the maximum over the different parts. A canonical example, that we 
consider throughout the paper, is the Min-Max /c~Partitioning problem: given an undirected graph 
G = {V, E) with nonnegative edge-weights and k > 2, partition the vertices into k (roughly) equal 
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parts Si, . . . , /Sfc so as to minimize maxj 5{Si), where 5{S) denotes the sum of edge-weights in the 
cut {S,V \ S). We design a bicriteria approximation algorithm for this problem. Throughout, let 
w : E ^ M"'" denote the edge- weights and let n = \ V\. 

Min-max partitions arise naturally in many settings. Consider the following application in the 
context of cloud computing, which is a special case of the general graph- mapping problem considered 
in | BLNZlj| (and also implicit in other previous works |YYRC08, ZA06| , CRB09| ]). There are n 



processes communicating with each other, and there are k machines, each having a bandwidth 
capacity C. The goal is to allocate the processes to machines in a way that balances the load 
(roughly n/k processes per machine), and meets the outgoing bandwidth requirement. Viewing the 
processes as vertices and the traffic between them as edge-weights, we get the Min-Max fc-Parti- 
tioning problem. In general, balanced partitioning (either min-sum or min-max) is at the heart of 
many heuristics that are used in a wide range of applications, including VLSI layout, circuit testing 
and simulation, parallel scientific processing, and sparse linear systems. 

Balanced partitioning, particularly in its min-sum version, has been studied extensively during 
the last two decades, with impressive results and connections to several fields of mathematics, see 

The min-max variants 
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contrast, have received much less attention. Previously, no approximation algorithm for the Min- 
Max ^-Partitioning problem was given explicitly, and the approximation that follows from known 
results is not smaller than 0(A;-v/Iogn).[] We improve this dependence on k significantly. 

An important tool in our result above is an approximation algorithm for the Small-Set Expansion 
(SSE) problem. This problem was suggested recently by Raghavendra and Steurer [ RS1(| (see also 
[ [RSTlOa , RSTlObj ) in the context of the unique games conjecture. Recall that the edge- expansion 
of a subset 5 C F with < |5| < \\V\ is 

5{S) 



\S\ 



The input to the SSE problem is an edge- weighted graph and p G (0, and the goal is to compute 



min ^iS). 

\S\<pn 



Raghavendra, Steurer and Tetali |RST10a | designed for SSE an algorithm that approximates the 



expansion within 0{-\J (l/$p) \og{l/ p)) factor of the optimum, while violating the bound on l^l by 
no more than a constant factor (namely, a bicriteria approximation) . Notice that the approximation 
factor depends on ^>p; this is not an issue if every small set expands well, but in general $p can be 
as small as l/poly(n), in which case this guarantee is quite weak. 

One can achieve a true approximation of O(logn) for SSE using ||Rac08(1 , for any value of p.^ 
If one desires a better approximation, then an approximation of 0{yJ\og n) using [ ARVO^ ] can be 
achieved at the price of slightly violating the size constraint, namely a bicriteria approximation 
algorithm. However, unlike the former which works for any value of p, the latter works only for 
p = r2(l). In our context of min-max problems we need the case p = 1/k, where k = k{n) is part 
of the input. Therefore, it is desirable to extend the 0(\/logn) bound of ||ARVO£ ] to a large range 
of values for p. 



^One could reduce t he probl em to the min-sum version of fc-partitioning. The latter admits bicriteria approxi- 



mation 0(\/logn log k) KNS09|, but the reduction loses another factor of fc/2. Another possibility is to repeatedly 
remove n/k vertices from the gra ph, paying again a factor of fc/2 on top of the approximation in a single iteration, 
which is, say, O(logn) by [ElacO^]. 



For very small values of p, roughly pn < 0(log n), a better approximation ratio is known [FKN03| 
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1.1 Main Results 



Our two main results are bicriteria approximation algorithms for the Min-Max ^-Partitioning and 
SSE problems, presented below. The notation Oi;{t) hides multiplicative factors depending on e, 
i.e., stands for 0{f{e) ■ t). 

Theorem 1.1. For every positive constant e > 0, Min-Max k -Partitioning admits a bicriteria 
approximation of [Oe{\/log n log k), 2 + e) . 

This theorem provides a polynomial-time algorithm that with high probability outputs a parti- 
tion 5*1, . . . ,Sk such that maxj \Si\ < (2 + e)^ and maxj 5{Si) < 0(-v/IogrnogA;)OPT, where OPT 
is the optimal min-max value of partitioning into k equal-size parts. (The guarantee on part size 
can be improved slightly to 2 — ^ -|- e). This result is most interesting in the regime 1 ^ A; ^ n. 

Theorem 1.2. For every positive constant e > 0, Small-Set Expansion admits a bicriteria approx- 
imation of (0£(\/lognlog 1 -|- e) . 

This theorem provides a polynomial-time algorithm that with high probability outputs a set S 
of size < l^l < {l+e)pn whose edge-expansion is 5iS)/\S\ = 0(Vlognlog(l/p))OPT, where OPT 
is the minimum edge-expansion over all sets of size at most pn. Our algorithm actually handles a 



more general version, called Weighted Small-Set Expansion, which is required in Theorem 1.1. We 
defer the precise details to Section ^. 

1.2 Additional Results and Extensions 

p— Unbalanced Cut. Closely related to the SSE problem is the following p-Unbalanced Cut 
problem: The input is again a graph G = {V, E) with nonnegative edge-weights and a parameter 
p G (0,^], and the goal is to find a subset S ^ V oi size |5| = pn that minimizes &{S). The 
relationship between this problem and SSE is similar to the one between Balanced Cut and Sparsest 



Cut, and thus Theorem 1.2 yields the following result. 



Theorem 1.3. For every constant < e < 1, the p-Unbalanced Cut problem admits a bicriteria 
approximation of {Og ( y^log n log ( 1 /p) ) , 1 -|- e) . 

This theorem says that there is a polynomial-time algorithm that with high probability finds 
5 C y of size il{pn) < \S\ < (1 + e)pn and value 6{S) < ©^(v^lognlog (l/p))OPT, where OPT 
is the value of an optimal solution to p-Unbalanced Cut. This result generalizes the bound of 



I ARV08 | from p = f^(l) to any value of p E (0, i]. Our factor is better than the O(logre) true 



approximation ratio that follows from | Rac08(| , at the price of slightly violating the size constraint. 



Our algorithm actually handles a more general version, called Weighted p-Unbalanced Cut, which 
is required in Theorem ^T^. We defer the precise details to Section p.4| . 

Min-Max-Multiway-Cut. We also consider the following Min-Max-Multiway-Cut problem, 
suggested by Svitkina and Tardos [ 3T04 1 : the input is an undirected graph with nonnegative edge- 



weights and k terminal vertices ti, . . . ,tk, the goal is to partition the vertices into k parts Si, . . . ,Sk 
(not necessarily balanced), under the constraint that each part contains exactly one terminal, so as 
to minimize maxj(5(S'j). They designed an 0(a log n)-approximation algorithm for this problem, 
where a is the approximation factor known for Minimum Bisection. Plugging a = O(logn), due 
to Racke |Rac08], the algorithm of Svitkina and Tardos achieves 0(log^ n)-approximation. Using 



a similar algorithm to the one in Theorem we obtain a better approximation factor. 
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Theorem 1.4. Min-Max-Multiway-Cut admits an 0{\/log n log k) -approximation algorithm. 



Somewhat surprisingly, we show that removing the dependence on n for Min-Max-Multiway- 
Cut (even though no balance is required) appears hard, which stands in contrast to its min-sum 
version, known as Multiway Cut, which admits 0(l)-approximation [ CKRO0| , KKS^04]. The idea 



is to show that it would imply a similar independence of n for the min-sum version of ^-partitioning, 
thus for large but constant k, we would get an (0(1), 0(l))-bicriteria approximation for Min-Sum 
/^-Partitioning, which seems unlikely based on the current state of art [1ARV08| , |AR06| , |KNS09| . 



Theorem 1.5. // there is a k^~'^ -approximation algorithm for Min-Max-Multiway-Cut for some 
constant e > 0, then there is a {k'^,^) bicriteria approximation algorithm for Min-Sum k-Partitio- 
ning with 7 < 32/^. 

Additionally, we also consider a common generalization of Min-Max /c-Partitioning and Min- 



Max-Multiway-Cut, which we call Min-Max Cut. In fact we obtain Theorem 1.4 as a special case 
of our result for Min-Max Cut. 

Excluded-minor graphs. Finally, we obtain an improved approximation - constant factor - for 
SSE in graphs excluding a fixed minor. 

Theorem 1.6. For every constant e > 0, Small-Set Expansion admits: 

• bicriteria approximation of (Oe(r^), 1 + e) on graphs excluding a Kr^r-fninor. 

• bicriteria approximation of (Og (log g), 1 + e) on graphs of genus g > 1. 

These bounds extend to the /J-Unbalanced Cut problem, and by plugging them into the proof 



of Theorems LI and L4, we achieve an improved approximation ratio of O(r^) for Min-Max k- 



Partitioning and Min-Max-Multiway-Cut in graphs excluding a iCr,r-minor. 
1.3 Techniques 

For clarity, we restrict the discussion here mostly to our main application, Min-Max /c-Partitio- 
ning. Our approach has two main ingredients. First, we reduce the problem to a weighted version 
of SSE, showing that an a (bicriteria) approximation for the latter can be used to achieve 0{a) 
(bicriteria) approximation for Min-Max /c -Partitioning. Second, we design an Oe{\/log n log(l/p)) 
(bicriteria) approximation for weighted SSE (recall that in our applications p = 1/k). 

Let us first examine SSE, and assume for simplicity of presentation that p = 1/k. Note that 
SSE bears obvious similarity to both Balanced Cut and min-sum ^-partition (its solution contains 
a single cut with a size condition, as in Balanced Cut, but the size of this cut is n/k similarly 
to the k pieces in min-sum fc-partition) . Thus, our algorithm is inspired by, but different from, 
the approximation algorithms known for these two problems [ ARVO^ , KNS09|. As in these two 



problems, we use a semidefinite programming (SDP) relaxation to compute an £2 metric on the 
graph vertices. However, new spreading constraints are needed since SSE is highly asymmetric in its 
nature — it contains only a single cut of size n/k. We devise a randomized rounding procedure based 
on the orthogonal separator mechanics, first introduced by Chlamtac, Makarychev, and Makarychev 
|CMM06| in the context of unique games. These ideas lead to an algorithm that computes a cut S 



of expected size IS*! < 0{n/k) and of expected cost 5{S) < 0{^/\ogTl\ogk) times the SDP value. An 
obvious concern is that both properties occur in only expectation and might be badly correlated, 
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e.g., the expected edge-expansion E[(5(5)/|5|] might be extremely large. Nevertheless, we prove 
that with good probability, |5| = 0{n/k) and 5[S)/\S\ is sufficiently small. 

For SSE on excluded-minor and bounded-genus graphs, we give a better approximation guar- 
antees, of a constant factor, by extending the notion of orthogonal separators to linear programs 
(LPs) and designing such low-distortion "LP separators" for these special graph families. The proof 
uses the probabilistic decompositions of Klein, Plotkin, and Rao | KPR93| ] and Lee and Sidiropou- 



los | LS10| . We believe that this result may be of independent interest. Let us note that the LP 



formulation for SSE is not trivial and requires novel spreading constraints. We remark that even 



on planar graphs, the decomposition of Racke Rac08 1 suffers an Q,i\ogn) loss in the approximation 



guarantee, and thus does not yield o(logn) ratio for SSE on this class of graphs. 

Several natural approaches for designing an approximation algorithm for Min-Max fc-Partitio- 



ning fail. First, reducing the problem to trees a la Racke [RacOS] is not very effective, because 
there might not be a single tree in the distribution that preserves all the k cuts simultaneously. 
Standard arguments show that the loss might be a factor of 0{k log n) in the case of k different cuts. 
Second, one can try and formulate a relaxation for the problem. However, the natural linear and 
semidefinite relaxations both have large integrality gaps. As a case study, consider for a moment 
Min-Max-Multiway-Cut. The standard linear relaxation of Calinescu, Karloff and Rabani [ |CKROO|| 
was shown by Svitkina and Tardos |ST04] to have an integrality gap of k/2. In Appendix ^ we 



extend this gap to the semidefinite relaxation that includes all triangle inequality constraints. A 
third attempt is to repeatedly remove from the graph, using SSE, pieces of size Q{n/k). However, 
by removing the "wrong" vertices from the graph, this process might arrive at a subgraph where 
every cut of Q{n/k) vertices has edge- weight greater by a factor of Q{k) than the original optimum 
(see Appendix ^ for details). Thus, a different approach is needed. 

Our approach is to use multiplicative weight-updates on top of the algorithm for weighted SSE. 
This yields a collection S of sets 5, all of size |5| = Q{n/k) and cost 5{S) < O(-v/IogrnogA;)0PT, 
that covers every vertex v £ V at least Q{n/k) times. (Alternatively, this collection S can be viewed 
as a fractional solution to a configuration LP of exponential size.) Next, we randomly sample sets 
Si, . . . ,St from S till V is covered, and derive a partition given by Pi = Si, P2 = Si, and in 
general Pi = Si\ (Uj<i5j). This step is somewhat counter-intuitive, since the sets Pi may have 
very large cost 6 (Pi) (because a set Pi might be a strict subset of a set Si'). We show that the 
total expected boundary of the partition is not very large, i.e., IE[X]j ^{Pi)] ^ 0{k^/]ognTogk)OPT . 
Then, we start fixing the partition by the following local operation: find a Pi violating the constraint 
^(Pi) ^ 0{\/\og n log A;)OPT, replace it with the unique Si containing it, and adjust other sets Pj 
accordingly. Somewhat surprisingly, we prove that this local fixing procedure terminates (quickly). 
Finally, the resulting partition consists of sets Pj, each of which satisfies the necessary properties, 
but now the number of these sets might be very large. So the last step is to merge small sets 
together. We show that this can be done while maintaining simultaneously the constraints on the 
sizes and on the costs of the sets. 

Organization. We first show in Section ^ how to approximate Weighted Small-Set Expansion (in 



both general and excluded-minor graphs). We then show in Section 2.4 that an approximation algo- 
rithm for Weighted Small-Set Expansion also yields one for Weighted p-Unbalanced Cut. In Section 
^ we present an approximation algorithm for Min-Max /c-Partitioning that uses the aforementioned 
algorithm for /O-Unbalanced Cut (and in turn the one for Weighted Small-Set Expansion). The 
common generalization of both Min-Max ^-Partitioning and Min-Max-Multiway-Cut, Min-Max 
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Cut, appears in Section ^ Theorem is proved in Section ^. 



2 Approximation Algorithms for Small Set Expansion 

In this section we design approximation algorithms for the SmaU-Set Expansion problem. Our main 
result is for general graphs and uses an SDP relaxation. It actually holds for a slight generalization 
of the problem, where expansion is measured with respect to vertex weights (see Definition |2.1| and 
Theorem ^]^) . We further obtain improved approximation for certain graph families such as planar 
graphs (see Section |2.3| ) . 

To simplify notation, we shall assume that vertex weights are normalized: we consider measures 
IJ, and 7] with fiiV) = ri{V) = 1. We denote ^(tt) = //({«}) and r]{u) = r]{{u}). We let {V,w) denote 
a complete (undirected) graph on vertex set V with edge-weight 'w{u, v) = w{v, u) > for every 
u ^ V G V. In our context, such {V,w) can easily model a specific edge set E, by simply setting 
w{u,v) = for every non-edge {u,v) ^ E. Recall that we let 5{S) := X]MeSi)6V\s ^(^' ^) 
total weight of edges crossing the cut {S,V \ 5), and further let w{E) denote the total weight of 
all edges. 

Definition 2.1 (Weighted Small-Set Expansion). Let G = {V,w) be a graph with nonnegative 
edge-weights, and let fi and r] be two measures on the vertex set V with fJ,{V) = r]{V) = 1. The 
weighted small set expansion with respect to p £ (0, 1/2] is 

$,,^,,(G) := min I X : r?(5) > 0, p{S) < p} . 

Theorem 2.1 (Approximating SSE). (I) For every fixed £ > 0, there is a polynomial-time algorithm 
that given as input an edge-weighted graph G = {V, w), two measures fi and r] onV (fi{V) = r]{V) = 
1), and some p € (0, 1/2], finds a set S d V satisfying ri{S) > 0, p{S) < (1 + £)p and 

||x^ < «x$,„„(G), (1) 

where D = Oe{\/\-Og n \og{\/ p)). 

(II) When the input contains in addition a parameter H G (0,1), the algorithm finds a non-empty 
set S CV satisfying p{S) < (1 + £)p, ri{S) G [n{H),2{l + £)H], and 

^^^^ x-^— < LI X minj^T^ X — : 7]{S) £ [H,2H], piS) < p] , (2) 



w{E) r]{S) - [wiE) r]{S) 



where D = Oe( y^log n log(max{l/p, l/H})). 



We prove part I of the theorem in Section |2.1| , and part II in Section [2,.2[ These algo- 
rithms require the following notion of m-orthogonal separators due to Chlamtac, Makarychev, 
and Makarychev iCMM06| . 

Definition 2.2 (Orthogonal Separators). Let X be an £2 space (i.e., a collection of vectors sat- 
isfying £2 triangle inequalities). We say that a distribution over subsets of X is an m-orthogonal 
separator of X with distortion D, probability scale a > and separation threshold P < 1 if the 
following conditions hold for S C X chosen according to this distribution: 
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» For all u £ X we have Pr(n £ S) = a 

• For all u,v £ X with \\u — v\\'^ > (3 min(||u|p, Iji'lP), 

o\ ^ min{Pr(n G 5),Pr(u G 5)} 
Pt(u G 5 and v e S) < — —. 

m 

» For all u,v £ X we have Pr(/s(n) ^ Is{v)) < aD x \\u — v\\'^ , where 1$ is the indicator 
function of the set S. 



Theorem 2.2 ( |CMM06 |). There exists a polynomial-time randomized algorithm that given a set 



of vectors X, positive number m, and /? < 1 generates m-orthogonal separator with distortion 
D = 13 {y^log \ X\ log m) and scale a > l/p(|X|) for some polynomial p. 



In the original paper |CMM06], the second requirement in the definition of orthogonal separators 
was slightly different, however, exactly the same algorithm and proof works in our case: If ||n— > 
\uf and \\uf < \\vf, then {u,v) = {\\uf + \\vf - \\u-vf)/2 < ((1 - /3)||nf + ||t;||2)/2 < (1 



l3/2)\\vf. Then, by Lemma 4.1 in 1CMM06|| , {ip{u),ip{v)) < (1-/3/2); hence \\<f{u)-(piv)\\^ > /? > 



and, in Corollary 4.6, \\Tp{u) - i;{v)\\ > 27 = V?/4 > 0. 

2.1 Algorithm I: Small-Set Expansion in General Graphs 

We now prove part I of Theorem ^J] . 

SDP Relaxation. In our relaxation we introduce a vector v for every vertex v £ V. In the 
intended solution of the SDP corresponding to the optimal solution S C V, v = 1 (or, a fixed unit 
vector e), if u G 5"; and v = 0, otherwise. The objective is to minimize the fraction of cut edges 

min — 7— w(u,v) \\u — v\\'^ . 

w(E) ^ V > ; II II 

We could constrain all vectors v to have length at most 1, i.e. < 1, but it turns out our 

algorithm never uses this constraint. We require that the vectors {v : u G V} U {0} satisfy £2 
triangle inequalities i.e., for every tt, u, it; G V, Hu— tt)|p < vlP + Hv— Uttlp < ||m— + 

— "iDp < + Suppose now that we have approximately guessed the measure H of the 

optimal solution H < r]{S) < 2H (this step is not necessary but it simplifies the exposition; in fact, 
we could simply let H = 1, since the SDP is otherwise homogeneous). This can be done since the 
measure of every set S lies in the range from rj{u) to nr](u), where u is the heaviest element in S, 
hence H can be chosen from the set {2*r/(u) : u £ V,t = 0, • • • , [log2 nj } of size 0{n log n). Then 
we add a constraint 

Y,M'v{v) > H. (3) 

We denote ??(u) = 'r]{{u}) and /i(u) = fi{{u}). Finally, we introduce new spreading constraints: for 
every u £ V, 

fi{v) ■ min{||u — \\u\\'^} > {l — p)\\u\\'^. 

(Alternatively, we could use a slightly simpler, almost equivalent constraint (^i, ^^)/i(t') < 

yoll-up. We chose to use the former formulation because an analogous constraint can be written in a 
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linear program, see Section 2.2.) In the intended solution this constraint is satisfied, since if u G S, 
then u = 1 and the sum above equals fi{V \ S) > 1 — p. li u ^ S, then u = and both sides of the 
constraint equal 0. 

The SDP relaxation used in our algorithm is presented below in its entirety. Note that the 
second constraint can be written as {u,v) < and the third constraint can be written as 

{u,v) > 0. 





min 


1 


E 

(u,v)eE 


w{u, v) \\u — 






w{E) 


s.t. 


\\u — w\\'^ + 


\w — w p 


> 


u — -up. 


Vu, v,w £ V, 






\u — -up 


> 


-||2 ||-||2 

u\\ — \\v\\ , 


Vn, V £ V, 




\\u\ 


' + 11^^11' 


> 


u — up. 


Vn, V £ V, 




min{ -u — V 


iMl-f} 


> ( 


^-P)\\n\\\ 


y ueV, 


vev 














Ell* 


\^r]{v) 


> 


H. 






vev 











(4) 



We now describe the approximation algorithm. 



Approximation Algorithm. We first informally describe the main idea behind the algorithm. 
The algorithm solves the SDP relaxation and obtains a set of vectors {u}u£V- Now it samples 
an orthogonal separator, a random set S C V, and returns it. Assume for the moment that 
a = I. Since Pr(u £ S) = we get E[?7(5)] > H. The expected size of the cut is at 

most D X SDP by the third property of orthogonal separators; and thus the sparsity is at most 
D X SDP/H < 2D X OPT. The second property of orthogonal separators guarantees that if n G S", 
then the vectors that are far from u, a very small fraction will belong to S (since the conditional 
probability Fi{v £ S \ u £ S) < 1/m is very small). And by the spreading constraints, at most 
(1 + £)p fraction of vectors (w.r.t. the measure p) is close to u. Hence, the total expected measure 
of S is at most (1 + e)/o + 1/m < (1 + 2e)p. We now proceed to the formal argument. 

We may assume that e is sufficiently small i.e., e £ (0,1/4). The approximation algorithm 
guesses approximate value of the weight H: H < r]{S) < 2H. Set the length of all vectors u with 
r]{u) > 2H to be 0. Solves the SDP and obtains a set of vectors X = {v}y^v Then, it finds an 
orthogonal separator S with m = max(e~^ p~^) and j3 = e. For convenience, we let S be the set 
of vertices corresponding to vectors belonging to the orthogonal separator rather than the vectors 
themselves. The algorithm repeats the previous step [a~^n^] times (recall a is the probabilistic 
scale of the orthogonal separator) and outputs the best S satisfying < p{S) < (l + 10e)p. With an 
exponentially small probability no S satisfies this constraint, in which case, the algorithm outputs 
an arbitrary set satisfying constraints. 



Analysis. We first estimate the probability of the event "ti £ S and p{S) < (1 + 10e)p" for a 
fixed vertex u £V . Let Au = {v : \\u — > /3||it|p} and Bu = {v : \\u — -up < /3||t(|p}. We show 
that only a small fraction of belongs to S*, and that the set is small. 
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From the spreading constraint ^^^^ mindlu — v\\^, \\u\\^)^{v) > (1 — /o)!!^!^, by the Markov 
inequahty, we get that /i(-B„) < p/(l — /3) < (1 + 2e)p. For an arbitrary v ^ Au (for which v 7^ 0) 
write ||n — -up > /3||u|P > /3min(||u|p, By the second property of orthogonal separators, 

Pr(?; ^ S \ u ^ S) < 1/m, thus the expected measure ^{Au n S) is at most E//(A„ n 5) < ep. Now, 
by the Markov inequahty, given that u £ S, the probabihty of the bad event "^(S") > (1 + We)p 
(and, thus fi{Au Ci S) > Sep)" is at most 1/8. Each vertex u £ V belongs to S with probability 
a||n|p. Hence, u £ S, and fJ-{S) < (1 + We)p with probability at least 3/4 all-up. 

Finally, we use the third property of orthogonal separators to bound the size of the cut 5{S) 

E5{S)= \Is{u) - Isiv)\w{u,v) <aD X ^ \\u - vfw{u,v) = aD x SDP x w{E). 

{u,v)^E {u,v)^E 



Here, as usual, SDP denotes the value of the SDP solution; and D = Oe{\/\og n log(l/(5)) is the 
distortion of m-orthogonal separators. 
Define function 

if |5| / 0, p,{S) < (1 + 10e)/9, and f{S) = 0, otherwise. The expectation 



E/(S) > '^ijm_:M_^M^^aH^ 



4-2 

u&V 

The random variable f{S) is always bounded by 2nH, thus with probability at least a/n, 
f{S) > 0. Therefore, with probability exponentially close to 1, after a~^n? iterations, the algorithm 
will find S with f{S) > 0. Since f{S) > 0, we get t]{S) > 0, p{S) < (1 + 10e)p, and 

6(S) 1 SDP 
-^r-T X <4D X . 

w{E) r]{S) - H 

This finishes the proof of part I since SDP/ (2H) < <^p^^^^(G). 



2.2 Algorithm II: Small-Set Expansion in General Graphs 

We now prove part H of Theorem |2.1| . This algorithm uses an SDP relaxation similar to part I, 
although we need a few additional constraints. We write a constraint ensuring that "?/(»S') < 2H" 
(recall H is an approximate value of r]{S) in the optimal solution): we add spreading constraints 
for all u G V, 

min{||-u — -ulp, ||m|P} r]{v) < 2H\\u\\'^, 

and we let m = max{e~^ p^^ , H^^ p~^} . We also require 

E II^IIV(^) < P- (5) 

Algorithm H gets H, the approximate value of the measure r]{S), as input, and thus does not 
need to guess it. 
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Remark 2.3. To handle terminals in the extended version of the problem (see Section^) we guess 
which terminal u £ T belongs to the optimal solution S (if any), and set \\u\\ = 1 and \\v\\ = for 
V € T \ {u}. Since an orthogonal separator never contains the zero vector, we will never choose 
more than one terminal in the set S. 

Approximation Algorithm. The algorithm consists of many iterations of a slightly modified 
Algorithm I. At every step the algorithm obtains a set S of vertices (returned by Algorithm I) and 
adds it to the set T, which is initially empty. Then, the algorithm removes vectors corresponding to 
S from the set X, the SDP solution, and repeats the same procedure till /x(T) > p/4 or r]{T) > H/4:. 
In the end, the algorithm returns the set T if //(T) < p and 'r]{T) < H, and the last set S otherwise. 

The algorithm changes the SDP solution (by removing some vectors), however we can ignore 
these changes, since the objective value of the SDP may only decrease and all constraints but @ 
are still satisfied. Since the total weight ?7(T) of removed vertices is at most H/4, a slightly weaker 
variant of constraint (^) is satisfied. Namely, 

Wnfvin) > 3H/4. (|) 
We now describe the changes in Algorithm I: instead of /, we define function /': 

if l^l / 0, n{S) < (l+10e)/> and r]{S) < {l+We)H and f'{S) = 0, otherwise. Notice, that /' has an 
extra term comparing to / and, in order for f'{S) to be positive, the constraint ri(S) < 2(1 + We)H 
should be satisfied. The new variant of Algorithm I, returns S, once f'{S) > 0. 

The same argument as before shows that for any given u G V conditional on u £ S, fJ,{S) < 
(1 + 10e)p and r]{S) < 2(1 + 10e)H with probability at least 3/4. Then, using a new constraint (|5|), 
we get E/i(S') < ap. Hence, the expectation 

Sax 3/4 H aH aH aH 

Ef (S) > > . 

J > - 4 4 4 - 16 

Again, after at most O(a^^n^) iterations the algorithm will find S with f'{S) > (and only with 
exponentially small probability fail)^. Then, f'{S) > implies 

and 7]{S) >Hx p{S)/{4p). 

The last inequality implies that at every moment r]{T) > H x p,{T)/{4p). Hence, if p{T) > p/4 
(recall, this is one of the two conditions, when the algorithm stops), then r]{T) > H/16. Therefore, 
if the algorithm returns set T, then r]{T) > H/16. If the algorithm returns set S then either 
p{S) > 3/4 p and thus t]{S) > 3H/16 or ri{S) > 3/4 H. 

Both, p{T) and r/(T) are bounded from above by p and H respectively; p{S) and r]{S) are 
bounded from above by (1 + We)p and 2(1 + 10e)H respectively. 

The inequality (Bl) holds for every set S added in T, hence this inequality holds for T. 



'in fact, now f'{S) < 2H, thus we need only 0(q ^n) iterations. 
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2.3 Small-Set Expansion in Minor-Closed Graph Families 



In this subsection we prove Theorem 1.6. We start by writing an LP relaxation. For every vertex 



u €z V we introduce a variable x{u) taking values in [0, 1]; and for every pair of vertices u,v G V we 
introduce a variable z{u,v) = z{v,u) also taking values in [0,1]. In the intended integral solution 
corresponding to a set 5 C V, x{u) = 1 u £ S, and x{u) = otherwise; z{u,v) = \x{u) — x{v)\. 
(One way of thinking of x{u) is as the distance to some imaginary vertex O that never belongs to 
S. In the SDP relaxation vertex O is the origin.) It is instructive to think of x{u) as an analog of 



\u\ 



and of z{u, v) as an analog of 



\u — v\ 



It is easy to verify that LP (|^) below is a relaxation of the Small-Set Expansion problem. It 
has a constraint saying that z{u,v) is a metric (or, strictly speaking, semi-metric). A novelty of 
the LP is in the third constraint, which is a new spreading constraints for ensuring the size of S is 
small. 



min 



w{E) 



\u — v\ 



{u,v)£E 

s.t. z{u,v) + z{v,w) > z{u,w), Vti,f,wGy, 

\x{u) — x{v)\ < z{u,v), Vti,f gV, 

fi{v) ■ minx(n), z{u, v) > {l — p)x{u), Vn G 



x{u),z{u,v) G [0,1], 



Vti, V £ V. 



(7) 



We introduce an analog of m-orthogonal separators for linear programming, which we call LP 
separators. 

Definition 2.4 (LP separator). Let G = iV-iE) he a graph, and let {x{u), z{u,v)}u.v<^v o, set 
of numbers. We say that a distribution over subsets of V is an LP separator of V with distortion 
D > 1, probability scale a > and separation threshold (3 G (0, 1) if the following conditions hold 
for S dV chosen according to this distribution: 

• For all u GV, Pr{u G S) = ax{u). 

• For all u,v GV with z(u,v) > f3mm{x{u),x{v)}, Pr(ti G S and v G S) = 0. 

• For all {u,v) G E, Pr(Is{u) ^ Isi'^)) ^ OiD x z{u,v), where Is is an indicator for the set S. 

Below we present an efficient algorithm for an LP separator: given a graph G = {V, E) excluding 
Kr^r as a minor, a parameter /3 G (0,1), and a set of numbers {x{u), z{u,v)}u,v€V satisfying the 
triangle inequalities described above (but not necessarily the spreading constraints), the algorithm 
computes an LP separator with distortion O(r^) (for genus g graphs the distortion is 0{logg)). 



This proves Theorem 1.6 as follows: by replacing in the algorithms above the SDP relaxation 
with the LP relaxation and the orthogonal separators with LP separators, we obtain O(r^) 
approximation algorithm approximation algorithm for SSE in Kr^r excluded-minor graphs. Com- 
bined with the framework in Section ^ we consequently obtain an 0(r^ )-approximation algorithm 
for Min-Max fc-Partitioning and Min-Max-Multiway-Cut on such graphs. 



11 



Computing LP Separators. We now describe an algoritlim tliat samples an LP separator (see 
Definition 2A) with respect to a feasible solution to LP (0). We recall a standard notion of low- 



diameter decomposition of a metric space, see e.g. [Bar96, GKL03, KR11| and references therein. 

Let {V, d) be a finite metric space. Given a partition P of y and a point v £ V, we refer to the 
elements of P as clusters, and let P{v) denote the cluster S € P that contains v, so v £ S £ V. A 
stochastic decomposition of this metric is a probability distribution v over partitions P of V. 

Definition 2.5 (Separating Decomposition). Let D, A > 0. A stochastic decomposition u of a 
finite metric space {V, d) is called a P'-separating A-bounded decomposition if it satisfies: 

• For every partition P G supp(z^) and every cluster S G P, 

diam(5) := max d{u, v) < A. 

u,v£S 

• For every u,v G V, the probability that a partition P sampled from v separates them is 

Pr [P{u)^P{v)]<D.^. 



Theorem 2.3 ( [|KPR93| , |Rao99| , |FT03|| ). Let G = {V,E) be a graph excluding Kr,r as a minor, 



equipped with nonnegative edge-lengths. Then the graph's shortest-path metric dc admits, for ev- 
ery A > 0, an 0{r'^) -separating A-bounded decomposition. Moreover, there is a polynomial-time 
algorithm that samples a partition from this distribution. 



Lee and Sidiropoulos LS10(| show similarly for graphs with genus g > 1 an 0(log5()-separating 



decomposition. Alternative algorithms for both cases are shown in ||KR11|| . 

Definition 2.6 (Probabilistic Partitioning). Consider a graph G = {V, E) and nonnegative numbers 
{x{u),z{u,v)}u,v&v ■ We say that a distribution u over partitions of V is called a probabilistic 
partitioning with distortion D > and separation threshold (3 > if the following properties hold: 

• For every edge {u,v) £ E with x{u) > 0: 

Pr {P{u) / P{v)) < D ■ z{u,v)/x{u). 

P^v 

• For every u.,v with z{u,v) > f3x{u), we have P{u) ^ P{v) for all P G supp(i^). 

Theorem 2.4 (Separating decomposition implies probabilistic partitioning). Let G = {V,E) be a 
graph that excludes K^^r cls a minor, and let {x{u) , z{u, v)}u,vev satisfy the first two constraints 
of LP (0). Then for every f3 G (0, 1], there is a probabilistic partitioning u with distortion D = 
0{r'^/3~^) and separation threshold /?. 

Proof. We define new lengths y{u,v) = mm{l, z{u,v)/x{u), z{u,v)/x{v)} for all u,v G V; by 
convention, if x{u) = or x{v) = then define y{u,v) = 0. (We remark that a similar approach 
was used in HCKROOH ). These lengths may violate the triangle inequality. Let d : V x V ^ M. he 
the shortest path metric in graph G with edge lengths equal y{u,v) (note: y{u,v) is defined for all 
pairs u,v £ V; however, to obtain d we look only at {u,v) £ E). Clearly, for every edge {u,v) G E, 
d{u,v) < y{u,v). On the other hand, as we show below, for every u,v £V, d{u,v) > ^y{u,v). 
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Claim 2.5. For aUu,v ^ V we have d{u,v) > f). 

Proof. Pick two vertices u,v £ V and consider an arbitrary path u = wi,W2, ■ ■ ■ ,W]y = v. We 
prove that the length of the path (in which the length of each edge {wi,Wi+i) is y{wi,'Wi+i)) is at 
least ^y{u,v). If the length of the path is greater than 2/3 we are done. Thus, we may assume 
that the lengths of all edges are at most 2/3 < 1. We also assume that x{u) > x{v) and thus 
y{u,v) < z{u,v)/x{u). We have, 

7V-1 N-l , . ^ N-l 

^ z{u,v) ^ x{u) y{u,v) 



maxj(x(wi)) maxj(a;(wj)) 

The second inequality holds since z{-,-) is a metric. If x(n)/ maxj(x(tDj)) > 2/3, we are done. 
Assume that for j = argmax x{u)/x{wj) < 2/3. Then, 

N-l N-l , . N-l I / ^ , X| 

E z[Wi,Wi+i) sr^ \x(wi) - x{wi+i)\ 
\x{wj) — x{u)\ + \x{wj) — x{v)\ ^ x{u) \ ^ 2 



> '—-^ ^ '\ \ ' " ^>21 7^ > • 

x{wj) V x{wj)y 3 



This finishes the proof of Claim 2.5. □ 



We now apply the theorem of Klein, Plotkin, and Rao [|KPR95 ] to the metric d[u, v) and obtain 
a probabilistic partition V with A = /3/3 and D' = O(r^). This partition satisfies the following 
properties. 

• If z{u, v) > /3x{u) for u,v £ V, then either x{u) > x{v) and hence y(n, v) > /3, or x{v) > x{u), 
then (using z{u,v) > x{v) — x{u)) 

ziu,v) > ^ix{v) - x{u)) + (1 - ^)z{u,v) > ^ix{v) - x{u)) + (1 - ^)Px{u) > ^x{v). 



Thus, y{u,v) > (3/2 in either case and d{u,v) > 2/?>y{u,v) > (3/3 = A (by Claim |2.5|). 
• For every {u, v) G E, 

Pr(P„^P.)<Z;'^<^^. 

A A x[u) 

The distortion D equals D'/A = 0{r^/(3). □ 

Given a solution for LP (0) (the relaxation for SSE problem), we could proceed as follows: 
Construct a probabilistic partition V with distortion D = 0{r'^(3~^) and some constant separation 
threshold (3 G (0, 1), then pick a random vertex w G V with probability x{w)r]{'w) /^u^i'^)^i'^) 
and, finally, output the cluster P^. However, to highlight the similarity between this LP-algorithm 
and the previous SDP-algorithm (for general graphs), we give an algorithm for constructing LP 
separators, which in turn is used by the Small-Set Expansion algorithm. 
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Theorem 2.6. There exists an algorithm that given a graph G = (V, E) with an excluded minor 
Kr^r, 0- set of numbers {x{u), z{u,v)}u,v<^v satisfying the triangle inequality constraints, and a 
parameter j3 G [0, 1], returns an LP separator S d V with distortion D = r'^j3~^ and separation 
threshold j3. 

Algorithm. The algorithm samples a random partition P with distortion D = 0{r'^f3^^) and 
a separation threshold /3. For every C G P, let 

XooiC) = maxx(u). 
u&C 

The algorithm picks a random set 5 € P with probability Pr(S' = C) = Xoo(C)/n; and with the 
remaining probability 

1 - - y Xoo{C) > 1 - - V x{u) > 0, 
C&P u&/ 

the algorithm sets 5 = 0. 

Now, to guarantee that every vertex u is chosen with probability exactly ax{u)^ where a = 1/n, 
the algorithm removes some elements from S: it picks at random t G [0, 1] and outputs set 

S' = {ueS : x{u) > tXooiS)}. 

Analysis. Verify that S' satisfies the properties of LP separators (with a = 1/n). For every 
ueV, 



Xoo{P{u)) x{u) 



n Xoo{P{u)) 



Vi{u G 5') = Ep Pr(5 = P{u) I P)-Pr(x(u) > tXoo{P{u))) \ P 

Then, if z{u,v) > mm{x{u) , x{v)) , then P{u) ^ P{v) and hence 

Piiu,v G S) := Pr(P(u) = P{v) = S) = 0. 

Finally, 

Pr(n eS',v^ S') < Pt{v ^ S \ u e S') Pt{u G S') + Pr(x(?;) < tx^^iS) \ u e S') Pr(n G S' 
We estimate the first term (using that v has distortion D = 0(r^/?~^); see Definition |2.6| ) 

Pr(n G 5, t; ^ 5) = Pr(f 4S\u(^S)< x B \ / = — z(n, v), 
n n x{u) n 



x[u) 



n 



and then the second term 
Pr(x(v) < tXoc{S) I u G S')Pr{u G S') 



^-^Pr{x{v) < tXooiPu) I Xiu) > tXooiPu),S = Pu) 

n 

^^Pr(t > x{v)/xUPu) I t < x{u)/xooiPu)) 
n 

x{u) min(0, x(u) — x(t>)) z{u,v) 

X ; — ^ 



n 



x{u) 



n 



This completes the proof of Theorem 2.6. 
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2.4 From SSE to p-Unbalanced Cut 

p-Unbalanced Cut and SSE are equivalent, up to some constants, with respect to bicriteria ap- 
proximation guarantees. Indeed, the two problems are related in the same way that Balanced Cut 
and Sparsest Cut are. We refer the reader to |LR99, RSTlOa], and omit details from this version 



of the paper. 

Our intended application of approximating Min-Max /c-Partitioning (in Section ^), requires a 
weighted version of the p-Unbalanced Cut problem, as follows. 

Definition 2.7 (Weighted p-Unbalanced Cut). The input to this problem is a tuple {G,y,w,T,p), 
where G = iViE) is a graph with vertex-weights y : V ^ R+, edge-costs w : E ^ 1R>07 oi'^-d 
parameters T,p G (0, 1]. The goal is to find S of minimum cost 5{S) satisfying: 

1. y{S)>T-y{V); and 

2. \S\ <p-n. 

The unweighted version of the problem (defined in Section |1.2| ) has t = p and unit vertex- 
weights, i.e. y{v) = 1 for all v £ V. We focus on the direction of reducing Weighted p-Unbalanced 
Cut to Weighted Small-Set Expansion, which is needed for our intended application. Formally, we 
have the following corollary of Theorem We use OPT^^j „, .^ ^^ to denote the optimal value of 
the corresponding weighted p-Unbalanced Cut instance. 

Corollary 2.7 (Approximating /o-Unbalanced Cut). For every e > 0, there exists a polynomial-time 
algorithm that given an instance {G, y, w, r, p) of Weighted p-Unbalanced Cut, finds a set S satisfying 
\S\ < l3pn, y{S) > T/7 and 6{S) < a ■ OPT ^G,y,w,T,p) for a = 0£:(v^lognlog(max(l//), 1/r))), 
/3 = 1 + e and 7 = 0(1). 

Proof. Let S* be an optimal solution to {G,y,w,T,p), note that \S*\ < pn, y{S*) > r ■ yiV) and 
6(S*) = OPT^g .r,p) the optimal value of this instance. Define two measures on V as follows. For 
any SCV, set il{sj := \S\/n and r/(5) := y{S)/y{V). 

The algorithm guesses H > t such that H < ri{S*) < 2H (see Algorithm I above for an 
argument why we can guess H). Then it invokes the algorithm from part II on G with measures 
p, and T] as defined above, and parameters p,H. The obtained solution S satisfies \S\ = p{S) ■ n < 
(1 + e)pn and yiS) = r]{S) ■ y(V) > 0^(1) H ■ y{V) > 0^(1) r • y {V), since H > t. Furth ermore, 
6(3) < a • 6{S*) ■ r]{S)/r]{S*) < a • 6(3*) ■ 9^(1), where a = ©^(v'log n log(max(l/p, l/r))). □ 

3 Min-max Balanced Partitioning 

In this section, we present our algorithm for Min-Max ^-Partitioning, assuming a subroutine that 
approximates Weighted p-Unbalanced Cut (which is essentially a rephrasing of Weighted Small-Set 
Expansion). Our algorithm for Min-Max ^-Partitioning follows by a straightforward composition 



of Theorem |3.1| and Theorem 3^ below. Plugging in for (a, /3, 7) the values obtained in Section 



would complete the proof of Theorem |1.1| . 
3.1 Uniform Coverings 

We first consider a covering relaxation of Min-Max /^-Partitioning and solve it using multiplica- 
tive updates. This covering relaxation can alternatively be viewed as a fractional solution to a 
configuration LP of exponential size, as discussed further below. 
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Let C = {S '^V : \S\ < n/k} denote all the vertex-sets that are feasible for a single part. Note 
that a feasible solution in Min-Max /c-Partitioning corresponds to a partition of V into k parts, 
where each part belongs to C. Algorithm |l], described below, uniformly covers V using sets in C 
(actually a slightly larger family than C). It is important to note that its output 5 is a multiset. 

Algorithm 1: Covering Procedure for Min-Max /^-Partitioning: 

Set t = 1, and y^{v) = 1 for all v £ V 

while Y.vev > l/'^ do 

// Solve the following using algorithm from Corollary 2 . 7| . 



Let 5* C y be the solution for Weighted /9-Unbalanced Cut instance {G,y^,w, ^, ^] 
Set 5 = 5U{5*}. 

// Update the weights of the covered vertices, 
for every v £ V do 

|_ Set = ^ • if v e 5*, and y^^^{v) = y^{v) otherwise. 

Set t = t + 1. 
return S 



Theorem 3.1. Running Algorithm |^ on an instance of Min-Max k -Partitioning outputs S that 
satisfies (here OPT denotes the optimal value of the instance): 

1. For alls £S we have 6{S) < a ■ OPT and \S\ < (3 ■ n/k. 

2. For allvGV we have \{S e S : S B v}\/\S\ > l/i5jk). 

Proof. For an iteration t, let us denote y* := X^^gyy*(^)- The first assertion of the theorem is 
immediate from the following claim. 

Claim 3.2. Every iteration t of Algorithm || satisfies 6{S^) < a ■ OPT and |5*| < /3 • n/k. 

Proof. It suffices to show that the optimal value of the Weighted p-Unbalanced Cut instance 
{G, y*, vu, ^, ^) is at most OPT. To see this, consider the optimal solution {S*}^^i of the original 
Min-Max /c-Partitioning instance. We have \S*\ < n/k and w{6{S*)) < OPT for all i G [A;]. Since 
{S*}^^i partitions V, there is some j G [k] with y^{S*) > /k. It now follows that S* is a feasible 
solution to the Weighted p-Unbalanced Cut instance (G, y*, w, ^, ^), with objective value at most 
OPT, which proves the claim. □ 



We proceed to prove the second assertion of Theorem 3.1. Let i denote the number of iterations 
of the while loop, for the given Min-Max ^-Partitioning instance. For any v , let A'^j, denote the 
number of iterations t with 5* 3 v. Then, by the y-updates we have y^^^{v) = 1/2^". Moreover, the 
termination condition implies that y^~^^{v) < 1/n (since < 1/n). Thus we obtain Ny > log2 n 
for all V €z V. From the approximation guarantee of the Weighted p-Unbalanced Cut algorithm, it 
follows that y*(S'*) > :^-Y^ in every iteration i. Thus = y* - i .y*(5'*) < (^1 - ^yvK This 

— ■ = f 1 — 27fc ) " However Y^ > 1/n since the algorithm 

performs ^ iterations. Thus, i < 1 + A'j k ■ Inn < 5j k ■ log2 n. This proves \{S £ S : S B v}\/\S\ = 
Nv/i > (57)~iA:"i. □ 
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Alternative view: A configuration LP. We now describe an alternate approach to finding a 
cover S. Given a bound A on the cost of any single cut, define the set of feasible cuts as follows: 

:fx = {sqv : \s\<^, sis)<x]. 

We define a configuration LP for Min-Max A;-Partitioning as follows. There is a variable xs for 
each S E J-x indicating whether /not cut S is chosen. 



V{X) = min ^ xs 








s.t. Xs > I 








xs>0 


ys G 



The goal is determine the smallest A > such that 'P(A) < k. One can approximately solve this 
using the dual formulation: 



D(A) = max yy 




v&V 




S.t. ^yt;<l 


ys G Tx 






Vv > 


yv€V 



The dual separation oracle can be solved using Weighted Small-Set Expansion; so we can apply 
the Ellipsoid algorithm. Since we only have a multi-criteria approximation for Weighted Small-Set 
Expansion (see Section ||), the details for approximating the configuration LP are rather technical. 

3.2 Aggregation 

The aggregation process, which might be of independent interest, transforms a cover of G into a 
partition. Intuitively, we first let the sets randomly compete with each other over the vertices so as 
to form a partition; then, to make sure no set has large cost, we repeatedly fix the partition locally, 
and use a potential function to track progress. 

Theorem 3.3. Algorithm ^ is a randomized polynomial-time algorithm that when given a graph 
G = (y, E), an e £ (0, 1), and a cover S of V that satisfies: (i) every vertex in V is covered by at 
least c/k fraction of sets S G S, for c G (0, 1]; and (ii) all S £ S satisfy \S\ < 2n/k and 5{S) < B; 
the algorithm outputs a partition V of V into at most k sets such that for all P £ V we have 
\P\ < 2(1 + e)n/k and E[max5(P) : P £ V] < 8B/{ce). 

Analysis. 1. Observe that after step 1 the collection of sets {Pi} is a partition of V and Pi C Si for 
every i. Particularly, |Pj| < \Si\ < 2n/k. Note, however, that the bound S{Pi) < B may be violated 
for some i. We now prove that "^(-f*)] — 2,kB/c. Fix an i < |5| and estimate the expected 

weight of edges E{Pi,[Jj^iPj) given that Si = S. If an edge {u,v) belongs to E{Pi,Uj^iPj) then 
(u,f) G E(Si,V\Si) = E(S,V\S) and both ^ Uj<^iSj. For any edge {u,v) G 6{S) (with u G S, 
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Algorithm 2: Aggregation Procedure for Min-Max /c-Partitfoning: 



1 Sampling 

Sort sets in 5 in a random order: 5i, 5*2, . . . , S^g^. Let Pi = Si\ Uj<^iSj. 

2 Replacing Expanding Sets with Sets from S 

while there is a set Pi such that S{Pi) > 2B do 
|_ Set Pi = Si, and for all j / i, set Pj = Pj \ Si. 

3 Aggregating 

Let B' = max{lZ^6iP),2B}. 

while there are Pi ^ 0, Pj (i j) such that |Pj| + \Pj\ < 2(1 + e)n/k and 
5{Pi) + 5{Pj) < 2B'e'^ do 
|_ Set Pi = PiU Pj and set Pj = 0- 

4 return all non-empty sets Pj. 



V i S), Pr((u,v) E E{Pi,Uj>iPj) \ Si = S) < Pt{v i Uj<iSj \ S^ = S) < {1 - c/ky-\ since v is 
covered by at least c/k fraction of sets in S and is not covered by Si = S. Hence, 



E[w{E{Pi,UjyiPj)) \S, = S]<{1- c/ky''6{S) < (1 - c/ky~'B, 

and E[w{E{Pi,\JjyiPj))] < (1 - c/ky-^B. Therefore, the total expected weight of edges crossing 
the boundary of Pi's is at most Ei^o(l " c/kyB = kB/c, and E[£i S{Pi)] < 2kB/c. 

2. After each iteration of step 2, the following invariant holds: the collection of sets {Pi} is a 
partition of V and Pj C 5j for all i. Particularly, |Pj| < |5j| < 2n/k. The key observation is that 
at every iteration of the "while" loop, the sum ^ - S{Pj) decreases by at least 2P. This is due to 
the following uncrossing argument: 



6{Si) + Y,6{P,\Si) < 6{Si) + ^(s{Pj) + w{E{Pj\Si,Si))-w{E{Si\P„P, 

< S{Si)+(y^6iPj))+w{E{V\Si,Si))-w{E{P,,V\P^) 



= i^m))'-'^^iSy)-2b{Pi) < (j2S{Pj))-2B. 

3 3 

we used that Pj C S'j, all Pj are disjoint, ^j^i{Pj \ Si) C V \ Si, Pi C Si \ Pj, ^j^iPj = V\Pi. 
Hence, the number of iterations of the loop in step 2 is always polynomially bounded and after the 
last iteration E[^^(5(Pj)] < 2kB/c (the expectation is over random choices at step 1; the step 2 
does not use random bits). Hence, E[P'] < 4B/c. 

3. The following analysis holds conditional on any value of B'. After each iteration of step 3, the 
following invariant holds: the collection of sets {Pj} is a partition of V. Moreover, |Pj| < 2(l+e)n/A; 
and 6{Pi) < 2B'£-^ (note: after step 2, 5{Pi) <2B<B' for each i). 

When the loop terminates, we obtain a partition of V into sets Pj satisfying |Pj| < 2(1 + e)n/k, 
^jPj| = n, (5(Pj) < 2B'e^^, Yli^i^i) — ^B' , such that no two sets can be merged without 



violating above constraints. Hence by Lemma 3.4 below (with aj = |Pj| and hi = 5{Piy, the 



number of non-empty sets is at most 2 2(i+^)n/k 2B'e-^ = (1 + ^) + {£/2)k < k. □ 
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Lemma 3.4 (Greedy Aggregation). Let ai,...,at and bi,...bt be two sequences of nonnegative 
numbers satisfying the following constraints ai < A, bi < B, o-i < S and X]i=i h ^ T (for 

some positive real numbers A, B, S, and T). Moreover, assume that for every i and j (i 7^ j) 
either ai + aj > A or bi + bj > B. Then, t < S/A + T/B + max{S/A, T/B, 1). 

Proof. By rescaling we assume that A = 1 and B = 1. Moreover, we may assume that Oi < S 

and Yll=i bi <T hy shghtly decreasing values of ah Oj and bi so that all inequalities still hold. 

We write two linear programs. The first LP (LPj) has variables Xi and constraints Xi + Xj > 1 
for all i, j such that ai + aj > 1. The second LP {LPu) has variables yi and constraints yi + yj >1 
all i,j such that bi + bj > 1. The LP objectives are to minimize X]j ^^"^ minimize YliVi- 
Note, that {ai} is a feasible point for LPj and {bi} is a feasible point for LPjj. Thus, the optimum 
values of LPj and LPjj are strictly less than S and T respectively. 

Observe that both LPs are half-integral. Consider optimal solutions x*, y* where x^jy* £ 
{0, 1/2, 1}. Note that for every i,j either x* + > 1 or + 7/* > 1. Consider several cases, if for 

all i, X* + y* > 1, then t < S + T, since ^*=i(2;* + y*) < S + T. If for some j, x* + y* = (and 
hence x* = y* = 0), then x* + y* > I for i ^ j and, thus, t < S + T + 1. Finally, assume that for 
some j, x* + y* = 1/2, and w.l.o.g. x* = 1/2 and y* = 0. The number of i's with x* / is (strictly) 
bounded by 25. For the remaining i's, x* = and hence y* = 1 (because y* = y* + y* > 1), and 
thus the number of such i's is (strictly) bounded by T. □ 



4 Further Extensions 



Both Theorems 1.1 and 1.4 follow from a more general result for a problem that we call Min-Max 
Cut, defined as follows. The input is an undirected graph G = {V, E), nonnegative edge- weights w, a 
collection of disjoint terminal sets Ti, T2, . . . , C V (possibly empty), and parameters p £ [l/k,l] 
and C, D > 0. The goal is to find a partition Si, . . . , Sk oi V such that: 

1. For all i, Xi C Si] 

2. For all i, \Si\ < pn; 

3. For all i, 5{Si) < C; and 

4. Eim)<D. 

This problem models the aforementioned cloud computing scenario, where in addition, certain 
processes are preassigned to machines (each set Tj maps to machine i G [k]). The goal is to assign 
the processes V to machines [k] while respecting the preassignment and machine load constraints, 
and minimizing both bandwidth per machine and total volume of communication. 

Theorem 4.1. There is a randomized polynomial time algorithm that given any feasible instance 
of the Min-Max Cut problem with parameters k, p, C, D and any e > 0, finds a partition Qi, . . . , 
with the following properties: (i) For all i, Ti C Qi; (ii) For all i, \Qi\ < (2 + e)pn; (Hi) 
E [max*Li<5(Qi)] < 0,(VlognlogA;)C7; and (iv) E Ei'5(Qi)] < OeW\ogn\ogk)D . 



It is clear that in fact Theorem |4.1| generalizes both Theorems 1.1 and |1.4|. Let us now describe 



modifications to the Min-Max ^-Partitioning algorithm used to obtain Theorem 4.1. 
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Uniform Coverings. First, by the introduction of vertex weights, we can shrink each preassigned 
set Tj to a single terminal (for i G [k]). Then, feasible vertex-sets C in the covering procedure 
(Section consist of those S CV where weight(5) < pn (balance constraint) and |S'n{ti} 



< 1 

(preassignment constraint). The subproblem Weighted p-Unbalanced Cut also has the additional 
\S n {ti}^^i\ < 1 constraint; this can be handled in the algorithm from Section ^ by guessing which 
terminal belongs to S (see Remark 2.3). Using Corollary 2.7 we assume an (a, ;8,7) approximation 
algorithm for this (modified) Weighted p-Unbalanced Cut problem; where for any e > 0, a = 
Oei^/logn log(max{l//), 1/r})), /3 = 1 + e and 7 = 0^(1). 

Algorithm below gives the procedure to obtain a uniform covering S bounding total edge-cost 



in addition to the conditions in Theorem 3.1 



Algorithm 3: Covering Procedure for Min-Max Cut: 



set t -(^ 1, y^{v) ^ 1 for ah v eV, and ^ Y^vgV y^i"^)- 
while y* > i do 

for 2 = 0,..., log2 A; -|- 1 do 

solve the Weighted p-Unbalanced Cut instance {G, y*, w, i, p) using the algorithm 



from Corollary 13, to obtain S\i) C V. 

If {5{S\i)) < a ■ mm{C, 4Z)/2*}) then 5* ^ 5*(i) and quit for loop. 

set5^5u{5*}. 
for V € V do 

|_ set y^^^{v) ^ ^ • y*(f ) if -u G 5*, and y*^^{v) ^ y^{v) otherwise, 
set 

set t ^ t + 1. 
output S. 



Theorem 4.2. For any instance of Min~Max Cut, output S of Algorithm ^ satisfies: 

1. 5{S) <a-C and\S\< p -l for all S eS. 

2. \{S :Sb v}\ > log2 n for all v£V. 

3. \S\ < 57 A; • log2 n. 

4- EseS^(S) < IJajlog^n-D. 
Above, for any e > 0, a = Oe{\/\og n log k), j3 = 1 + e and 7 = Oe(l). 
Proof. We start with the following key claim. 

Claim 4.3. In any iteration t of the above algorithm, there exists an i ^ {0, 1, . . . , log2 A: -|- 1} such 
that <5(5*(i)) < a ■ min{C, 4:D/T}, \S\i)\ < ^ ■ I, and y\S\i)) > 

Proof. Consider the optimal solution {Sj}^^^ of the original Min-Max Cut instance. For all j G [A;] 
we have that |5*| < pn, d{S*) < C and S* contains at most one terminal. Moreover, Yl^=i ^i^j) ^ 
D. Since {S*}^^^ partitions V, we also have X^j=i y*(»S'*) = Y*'. Let L C [A;] denote the indices j 
having 6{S*)<^-y\S*). 
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We claim that T^jeLV'^iSj) ^ ^V2- This is because: 



i^i j=i 

Since |L| < /c, there is some q £ L with y^{S*) > Let i G {1, . . . ,log2 + 1} be the value 
such that y^{S*)/Y^ G [i, 21^]; note that such an i exists because y^{S*)/Y* G [2^, 1]. For this i, 
consider the Weighted p-Unbalanced Cut instance (G, y*, tt;, i, p). Observe that S* is a feasible 
solution here since y^{S*) > y*/2*, < and S* contains at most one terminal. Hence the 
optimal value of this instance is at most: 



<5(5*)<min|c, 



< mm < G, — - 



2» 

The first inequality uses the definition of L and that 5{S*) < C, and the second inequality is by 



choice of i. It now follows from Corollary that solution S (i) satisfies the claimed properties. 
We note that a = Ol;{^/]ogrllogk) because each instance of Weighted p-Unbalanced Cut has 
parameters r = i- > | and p > ^- □ 



For each iteration t, let it G {1, . . . ,log2 k + 1} he the index such that S = S (it). Claim 4.3 



implies that such an index always exists, and so the algorithm is indeed well-defined. Condition 1 



of Theorem 4.2 also follows directly. Let £ denote the number of iterations of the while loop, for 
the given Min-Max Cut instance. For any v £ V, let denote the number of iterations t with 
5* B V. Then, by the y-updates we have y^^^{v) = 1/2^". Moreover, the termination condition 
implies that y^~^^{v) < ^ (since y^+i < i). Thus we obtain > log2 n for all v £ V, proving 
condition 2 of Theorem |4.2| . 

Claim |4.3| implies that for each iteration t, we have y*(S'*) > and (5(5*) < 4aD/2**. Since 
it ^ log2 ^ + 1, we obtain: 



,.(5')>„ax{^.^.^(5') 



1. Using y\S^) > ^ in each iteration, Y^+^ = y< - i . y*(5*) < (l - 4^) ■ Y\ This impli 



nes 



that < ~ " = \ ~ 47^/ " however Y^ > ^ since the algorithm 

performs i iterations. Thus, i < 1 + 87 /c • Inn < 97 /c • log2ri. This proves condition 3 in 
Theorem |4^ . 

Using y\S') > ■ 5{S') in each iteration, y*+i = y* - i • y*(5*) < (l - 



-Vy^<expf-^a^l 

» y ~ V 807 y 



-<Y'< U'-} ( 1 - 1 • y^ < exp I - ^'=^"-" ' I • n 



This implies X]t=i '^('S'*) ^ (1607 Inn) • D. Adding in 6{S^) < a ■ C < aD, we obtain 
condition 4 of Theorem [4.2[ 



This completes the proof of Theorem 4.2. □ 
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This step remains essentially the same as in Section |3.2| , namely Algorithm y (with 
= a ■ C). The only difference is that in Step 3 we do not merge parts containing 



Aggregation 

parameter B :■ 

terminals. We first show that this yields a slightly weaker version of Theorem 4.1: in condition (ii) 
we obtain a bound of (3 + e)pn on the cardinality of each part. (Later we show how to achieve the 
cardinality bound of (2 + e)pn as claimed in Theorem [4.1| .) 

Note that each of the final sets {Pi} is a subset of some set in S, and hence contains at most one 
terminal. It also follows that the final sets {Pi} are at most 2k in number: at most k of them contain 
no terminals (just as in Theorem |3.3|) , and at most k contain a terminal (since there are at most k 
terminals). Each of these sets {Pi} has size at most (2 + e)pn and cut value at most 8B/{ce), by 
the analysis in Theorem 4.1. Moreover, if a set Pi contains a terminal then \Pi\ < (5 ■ pn = {l + £)pn 
(since it does not participate in any merge). Finally in order to reduce the number of parts to k, 
we merge arbitrarily each part containing a terminal with one non-terminal part; and output this 
as the final solution. It is clear that each part has at most one terminal, has size < (3 + s)pn, and 
cut value at most 0£(\/lognlog/c) • C. The bound on total cost (condition (iv) in Theorem [4.1D is 
by the following claim. This proves a weaker version of Theorem [4.1| , with size bound (3 + £)pn. 



Claim 4.4. Algorithm 
fyingE\j:l,wi5{Pi)) 



I applied on collection S from Theorem outputs partition {Pj}^^| satis- 
= Os{^/Togrllogk) D. 



Proof. We will show that the random partition {Pi} at the end of Step 1 in Algorithm ^ satisfies 
E[^i'w{5{Pi))] < Olr{^/logn log k)D. This would suffice since '^iW{6{Pi)) does not increase in 
Steps 2 and 3. For notational convenience, we assume (by adding empty sets) that |5| = 67 fcdog2 n 



in Theorem 4.2; note that this does not affect any of the other conditions. 



To bound the cost of the partition {Pi} in Step 1, consider any index i < From the proof 



of Theorem 3.3, we have: 



E[w{E{Pi,UjyiP,)) \Si = S]<{l- c/ky-^6{S), 

where c = ^ is such that each vertex lies in at least c/k fraction of sets S. Deconditioning, 
E[w{E{Pi,Uj>iPj))] < {1-c/ky-^ -EldiSi)] = (l-c/ky-^ • (EseS ' ^^^^^^ we used that 
Si is a uniformly random set from S. So the total edge-cost, 



E 



2.J]EKS(P„U,>.P,))]< [^(1-cA)M.(E<5(5)/|5|) =--5^ 
i \i>o / \ses / ^ ses 



5iS)/\S\. 



Using 5(5") < 17a7log2n-D and |5| = 57 /c-logg n from Theorem |3, E (5(P,)] < D = 

Oe(-v/log n log k) D since a = Oe(\/logn log k) and 1/c = Oe(l). □ 



Obtaining size bound of (2 -|- £)pn. We now describe a modified aggregating step (in place of 
Step 3 in Algorithm |3|) that yields Theorem [4.1| . Given the uniform cover S from Algorithm ^, run 
Steps 1 and 2 of Algorithm |3| (use B = aC) to obtain parts Pi, ... , i-j^j. Then: 

1. Set B' := max {I Z^KPi), '^B}. 

2. While there are Pi, Pj / (i / j) such that \Pi\ + \Pj\ < {I + e)pn, 6{Pi) + S{Pj) < 2B' and 
Pi U Pj does not contain a terminal: replace Pj Pj U Pj and Pj 0. 
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3. Sort the resulting non-empty sets Pi, . . . ,Pt in non-increasing order of size. 

4. Form It/kl groups where the j*'^ group consists of parts indexed between {j — l)k + 1 and jk. 

5. For each i S [k] define Qi as the union of one part from each group such that it contains 
terminal i but no other terminal. Additionally, ensure that each part is assigned to one of 

{Qi}ti- 

We first show that the number of parts after Step 2 above t < Ak. Note that each part contains 
at most one terminal, and the number of parts containing a terminal is at most k. For the non- 
terminal parts, using Lemma |3.4| (with = bi = 6{Pi), a = (1 + e)pn, b = 2B' , S = n and 
T = kB') we obtain a bound of 5fc/2, which implies t < Ak. 

Next observe that sets {Qi}f=i in Step 5 above are well-defined, and can be found using a simple 
greedy rule: this is because each group contains at most k parts. This gives Theorem (i). Since 
t < 4:k there are at most 4 groups and hence max^^-^ 6{Qi) < 4 • max-^-,^ ^{Pi) ^ 8B'. This proves 
Theorem [4.1| (iii). Also, the proof of Claim implies Theorem |4.l| (iv); this is due to the fact 

that in the final partition J2i=i ^{Qi) ^ Y^tli ^{Pi)- 

We now show Theorem |4.l| (ii), i.e. max^^-,^ \Qi\ < (2 + e)pn. Consider any i G [A;] and let P/ 
denote the part assigned to Qi from the first group. By the non-increasing size ordering Pi, . . . ,Pt 
and the round-robin assignment (Step 5) into QjS, we obtain that \Qi\ — < \Qj\ for all j G [k]. 
Taking an average, \Qi\ — < ^ 'l2j=i \Qj\ — I- Finally, since each part {Pe}\^i has size at most 
(1 + e)pn we have \Qi\ < (2 + e)pn. This completes the proof of Theorem |4.1| . 



5 Hardness of Min-Max-Multiway-Cut 

In this section we prove Theorem ^T^, which shows that obtaining a /c^~^-approximation algorithm 
for Min-Max-Multiway-Cut is hard, if not unlikely. This suggests that some dependence on n 
might be necessary (unless we are satisfied with approximation that is linear in k, which is trivial), 
which is in contrast to several cut problems (multiway-cut, multicut, requirement-cut etc) with 
sum-objective where poly{log k) approximation guarantees are known when k is the size of the 
terminal set [|Moi09| , |LM10| , [EGK+10| , |CLLM10| , [MM10|] . Throughout this section, we assume k 



is constant (independent of n), which simplifies the statements; strictly speaking, our reductions 
relate solving one problem with parameter k{n) to solving another problem with parameter k{n'). 

We will refer to the min-sum version of Min-Max ^-Partitioning, called Min-Sum fc-Partitio- 
ning, in which the input is an edge-weighted graph G = {V, E) and a parameter k, and the goal is 
to partition the vertices into k equal-sized parts while minimizing the total edge- weight of all edges 
cut. An algorithm for Min-Sum /c-Partitioning is an [a, /3) bicriteria approximation if for every 
instance, it partitions the vertices into k pieces, each of size at most j3\V\/k, and the total edge- 
weight of all edges cut is at most a times the least possible among all partitions into k equal-size 
sets. 



The basic idea in proving Theorem 1.5 as follows, although there is no vertex-balance require- 
ment in Min-Max-Multiway-Cut, an edge-balance is implicit in the objective. By introducing a 
complete bipartite graph (having suitable edge weight) between the terminals and the rest of the 
graph, this edge-balance can be used to enforce the vertex-balance required in Min-Sum A;-Parti- 
tioning. After this first step, which obtains a bicriteria approximation for Min-Sum /c-Partitioning, 
we do a second step which improves the size- violation in the algorithm for Min-Sum /c-Partitioning. 
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These two steps are formalized in the two foregoing Lemmas 5.1 and 5.3, Putting them together 



immediately proves |1.5 



Lemma 5.1. // there is a p- approximation algorithm for Min-Max-Multiway-Cut then there is a 
{5pk, 10 p)-bicriteria approximation algorithm for MinSum k -Partitioning. 

Proof. Let A denote the ^-approximation algorithm for Min-Max-Multiway-Cut. Consider any 
instance I of Min-Sum /c-Partitioning: graph G = {V, E), edge costs c : E ^ and parameter k. 
For every B > 0, consider an instance J{B) of Min-Max-Multiway-Cut on graph Gb defined as: 

• The vertex set is ^U{^«}iLi where {ti\^^^^ are the terminals. 

• The edges are E[j{{ti,u) : i e [k], u e V}. 

• Extend cost function c by setting c{ti,u) = ^ for all i G [k], u G V . 

Note that every solution to J{B) corresponds to a fc-partition in G (though possibly unbalanced). 
We say that a solution to J{B) is /^-balanced if each piece in the partition has size at most f3 ■ 

The algorithm for Min-Sum /c-Partitioning on I runs algorithm A on all the Min-Max-Multiway- 
Cut instances {j'{2^) : < i < log2 {YleeE '^e)} , and returns the cheapest partition that is (Wp)- 
balanced. We now show that this results in a {5pk, Wp) bicriteria approximation ratio. 

Note that algorithm A must be invoked on J{B) for some value B with B < OPT(X) < 2 ■ B. 
We will show that the partition resulting from this call is the desired bicriteria approximation. 

Claim 5.2. OPJ{J{B)) < OPT(X) + 2B < 50PT(T). 

Proof. Let P* denote the optimal fc-partition to I. Consider the solution to J{B) obtained by 
including each terminal into a distinct piece of P* . The boundary of the piece containing ti (any 
i ^[k]) m. Gb costs at most: 

OPT(X) + - + (A; -!)•-< OPT(X) + 2B, 

n k n 

the term OPT(X) is due to edges in E, the second term is due to edges at ti and the third is due 
to edges at all other {tj : j ^ i}. The claim now follows since B < 2 0PT(X). □ 

Let P denote ^'s solution to J{B), and {Pi}^^i the partition of V induced by P. From 



Claim 5.2, P has objective value (for Min-Max-Multiway-Cut) at most 5/9 0PT(X). Note that the 



boundary (in graph Gb) of tj's piece in P costs at least c{6G{Pi)) + (A; — 1)-^ • |Pj|. Thus we obtain: 

c(6GiPi)) + (k- 1)- • \Pi\ < 5pOPT(X), for ah i £ [k]. 

n 

It follows that for every i G [k], we have: 

L \Pi\ < SpOgLW . _^ < lOp . I since B > OPT(X), and 
2. c{6G{Pi)) < 5p-0PT(X). 

Thus the solution {Pi}i^i to X is (10/9)-balanced and costs at most 5pk ■ OPT(X). This complete 
the proof of Lemma □ 



Lemma 5.3. If there is an (a, k^~'^) -bicriteria approximation algorithm for Min-Sum k-Parti- 
tioning for some constant e > 0, then there is also an {a ■ log log A:, j)-bicriteria approximation 
algorithm with 7 < 3^/^. 
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Proof. Let A denote the (a, A;i-^)-bicriteria approximation algorithm. The idea is to use A recur- 
sively to obtain the claimed (a • log log A:, 7) approximation; details are below. Let G denote the 
input graph with n vertices, and OPTthe optimal balanced /c-partition. The algorithm deals with 
several sub-instances of G, each of which is assigned a level from {0, 1, . . . , t} where t := G(log log k) 
is fixed below. Every level i instance will contain ki = parts and nj = ^-ki vertices. Choose 

t to be the smallest integer such that kt < 3^^^. While generating the sub- instances we also add 
dummy singleton vertices (to keep the instances balanced). For notational simplicity we use the 
same identifier for a sub-instance and the graph corresponding to it. Note that G is the unique level 
instance. For each i G {0, 1, . . . , t — 1}, every level i instance I generates ki level i + 1 instances 
as follows: 

• Run algorithm ^ on X to obtain a kl~^ -halanced partition {Pj : 1 < j < ki} of X. 

• For each j £ [ki], add n^+i — | singleton vertices to I[Pj] to obtain a new level i + 1 instance. 

Note that this process is indeed well-defined since \Pj\ < ^ • k^"^ < ^ • kj = • ^i+i = "-j+i- 

Moreover the total number of level i + 1 instances is n^=o^^ ~ k^^^oi^"'^^'^'^^ < A;^/^. Since each 
instance has size at most n, the total size of all instances is polynomial. 

The algorithm finally returns the partition V corresponding to the set of all level t instances. 
Note that there are at most nt = j ■ kt < 3^/^ • ^ vertices in each level t instance. The algorithm 
ignores all dummy vertices in each piece of V and greedily merges pieces until every piece has at 
least n/k vertices (all from V); let denote the resulting partition of V. Clearly there are at most 
k pieces in 7^', and each has size at most 32/= - f. ThusP' is a 3 '^-balanced fc-partition. 

We now upper bound the total cost of all edges removed by the algorithm (over all instances); 
this also bounds the cost of V\ This is immediate from Claim below: since there are t levels 
and A achieves an a-approximation to the cost, the total cost is bounded by at • OPT. 

Claim 5.4. For each i G {1, . . . the sum of optimal values of level i instances is at most OPT. 

Proof. Consider any fixed i, and an instance I of level i. We will show that the edges of OPTinduced 
on X form a balanced /cj-partition for X. This suffices to prove the claim since the level i instances 
partition V (the original vertex-set). Let V denote the vertices from V in X; note that \V'\ < 
• kjzf . Consider the partition Q of V induced by OPT; note that each piece in Q has size at 
most n/k. Greedily merge pieces in Q as long as the size of each piece is at most n/k, to obtain 
partition Q'; so the number of pieces is: 

IQ'I < 1 + 2\v'\ • - < 1 + 2^ kill • - = 1 + 2kl:i < 3kl:i < j^zf = h. 

n ki^i ^ ^ n * ^ * * 

The second-last inequality uses > 3^/"^ which is true by the choice of t. Now we can fill pieces 
of Q' with dummy singleton vertices of instance X to obtain a balanced fcj-partition of X. □ 



This completes the proof of Lemma 5.3. □ 
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Appendix 

A Integrality Gap for SDP Relaxation of Min-Max-Multiway-Cut 

Consider the following semi-definite relaxation for the min-max multiway cut problem: 
min A 

s.t. A> ^ \\yu,i-yv,i\\l Vi = l,2, (10) 

k 

I|y«,j|l2 = 1 

i=l 

WvuAll = 1 
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VvJ — I \ yu,i \ 1 2 
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\\yu,i yv,j\\2 ' ll?/f J yw,r\\2 — \ \yu,i yw,r\\2 

yu,i ■ yv,j > 
I \ yu,i \ 1 2 — yu,i ■ yv,j 

In the above relaxation, each vertex u £ V is associated with k different vectors: yu,i,yu,2, ■ ■ ■ , yu,k- 
Each of those corresponds to a different terminal. Notice that the last two constraints of the 
relaxation are the £3 triangle inequality constraints including the origin (the same constraints used 
by |CMM06^ ). 

As stated, the integrality gap we consider is the star graph which constrains as single vertex 
u connected by k edges to the k terminals. The value of any integral solution to this instance is 
exactly k — 1 since the only choice is to which terminal should u be assigned. Any choice made 
results in a min-max objective value of k — 1. 

Let us construct the fractional solution to the above relaxation. Fix e to be an arbitrary unit 
vector, and xi,X2, ■ ■ ■ ,Xk be k unit vectors which are all orthogonal to e and the inner product 
between any two of them is —l/{k — 1). Set the following fractional solution: 

yu,i = e Mi = 1,2,... ,k 

yu,j = Vi / j 
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Mi = 1,2, . . . ,k 
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First, we show that the above fractional solution is feasible. Constraints (111]), (O) and (p^ are 
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obviously feasible for all terminals. Vertex u also upholds constraint (^) since: 
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Let us verify that vertex u satisfies also constraint (13): 

Vk^ 



1 , ./k^ 
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Focus on constraint ([l^. It is easy to verify that for all terminals, the sum of all k vectors 
that are associated with the picked vertex is exactly e. Since Yli=i — 0) °f vectors 

associated with u is also e. Therefore, all constraints of type (|l^ are satisfied. 

Regarding the last three constraints, which are the £2 triangle inequality constraints including 
the origin, one can verify the following: 
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All the last three constraints are derived from the above calculations. Hence, we can conclude that 
the fractional solution defined above is feasible for the semi-definite relaxation. 
The value of this solution is: 



max 

l<i<k 



-1 



k-i 



This gives an integrality gap of k/2. 



B Bad Example: Greedy Algorithm for Min— Max /c— Partitioning 

We show that the naive greedy algorithm that repeatedly uses Small-Set Expansion to remove a part 
of size n/k performs very poorly. In this example we even assume that there is an exact algorithm 
for Small-Set Expansion. The graph is a tree on n = k'^ vertices V := {v} Uti^{^i,0) • • • ; and 
edges E = \J^~^{{ui J, Ui J ^i) : 1 < j < A:} U {('"i.o, -"i-i.c) : 2 < i < A; - 1} jj {(w, ni,o)}. 

The simple greedy algorithm will cut out parts having small boundary for k — 1 iterations, 
namely Pi = {itj,i, . . . , Ui^k} for ^ ^ ~ 1] j note that S{Pi) = 1 for all i G [A; — 1]. However the last 
part {v, ui^o, . . . , Uk-ifl} has cut-value Ac — 1; so the resulting objective value is A: — 1. 

On the other hand, it can be checked directly that the optimal value is at most four: Con- 
sider the partition obtained by repeatedly taking the first k consecutive vertices from the ordering 

V, Uifi, . . . , ttl,A,., tt2,0, • • • , U2,k, ■ ■ ■ , Uk-1,0, • • • , Uk-l^k- 
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